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Semantic equivalences are used in process algebra to capture the notion of similar behaviour, and 
this paper proposes a semi-quantitative equivalence for a stochastic process algebra developed for 
biological modelling. We consider abstracting away from fast reactions as suggested by the Quasi- 
Steady-State Assumption. We define a fast-slow bisimilarity based on this idea. We also show 
congruence under an appropriate condition for the cooperation operator of Bio-PEPA. The condition 
requires that there is no synchronisation over fast actions, and this distinguishes fast-slow bisimilarity 
from weak bisimilarity. We also show congruence for an operator which extends the reactions avail- 
able for a species. We characterise models for which it is only necessary to consider the matching of 
slow transitions and we illustrate the equivalence on two models of competitive inhibition. 

1 Introduction 

One of the features of process algebra is behavioural or semantic equivalence |[29l 1241 which determines 
if two processes act the same way. Furthermore, congruence is of interest where the behaviours of two 
equivalent systems are indistinguishable within any context, in the sense of combining systems using one 
or more operators of the process algebra. Notions of equivalence can be also useful in systems biology. 
For example, equivalences can be used to compare the behaviour of two systems or parts of them, and 
to show the consistency between different abstractions of the same system. Furthermore, if congruence 
holds then it may be possible to replace a (part of the) system with a smaller equivalent component 
without changing the behaviour, and thus reduce the state space. 

We focus on Bio-PEPA lfl4l . a process algebra defined for modelling and analysis of biochemical 
networks. Bio-PEPA models have a number of interpretations, including ordinary differential equations 
(ODEs) and stochastic simulation Ell . We develop our equivalence in the context of mapping a Bio- 
PEPA model to a finite transition system (which can be interpreted as a finite continuous -time Markov 
chain (CTMC)). Such models are called Bio-PEPA models with levels since we assume a maximum quan- 
tity for each species, and we stratify molecule counts or discretise concentrations into levels, resulting in 
a finite and tractable transition system, thus ameliorating the state space explosion problem. 

Here we apply a traditional technique of process algebras, namely semantic equivalence to Bio-PEPA 
with levels. Isomorphism and strong equivalence (adapted from PEPA [24]) have both been defined for 
Bio-PEPA lfl4l [191 , but both relations are very strong notions of equivalence and not able to capture 
biological behaviour of interest. 

By contrast, our approach is semi-quantitative in that we consider relative rather than actual speeds of 
reactions, and use this to determine whether two models have similar behaviour by abstraction from the 
faster reactions. The more abstract model has fewer species and hence fewer parameters. This reduced 
model can be parameterised more straightforwardly when there is limited experimental data. 
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We define fast-slow bisimilarity for Bio-PEPA inspired by biology. The motivation comes from the 
Quasi-Steady-State Assumption (QSSA) [36] which may reduce systems of ODEs where there are large 
differences in reaction rates. This is achieved by abstracting from fast reactions, by assuming almost no 
change in the amount of intermediate products (effectively assuming a steady state for these products), 
obtaining fewer reactions and a smaller system of ODEs. The rates of reactions in the reduced system 
are no longer based on mass action but are defined in more complex manner which is determined in the 
derivation of the reduced system of ODEs. It is then possible to work with the reduced system as a model 
thereby requiring fitting of fewer parameters. 

As with any semantic equivalence we investigate congruence. Fast-slow bisimilarity is shown to 
be a congruence with respect to cooperation under the condition that there are no fast actions that can 
occur between pairs of components. This restriction for congruence is not a significant limitation as 
it has been shown that introducing other fast reactions (beyond those that are considered by QSSA) 
involving the species to which QSSA is applied can drastically reduce the accuracy of the QSSA OBI . 
Fast-slow bisimilarity is similar in definition to an existing equivalence called weak bisimilarity where 
the behaviour of silent or invisible X actions is abstracted away. However, the additional condition needed 
to show congruence distinguishes them. Using an existing technique ll22l . we are able to show that for 
certain reduced models, it is possible to work with a definition of bisimilarity which only considers slow 
reactions. 

The rest of the paper is structured as follows. An introduction to QSSA follows, then Bio-PEPA is 
introduced and the definition of fast-slow equivalence is proposed. Congruence is proved for cooperation 
and the extension operator. Slow bisimulation is defined and conditions identified for which this is 
sufficient, followed by an example to illustrate our equivalence. Finally related and further work is 
discussed and concluding remarks are given. 

2 Quasi-Steady-State Assumption 

In this section, we consider an existing approach to model reduction which reduces the number of species 
to be considered and determines new reaction rates. First, consider a set of non-oscillating reactions. We 
can identify a set of reactants that are present before the reactions start, say E and a set of products *P that 
are present once all reactions have completed. However, complexes may be created during the reactions 
- these are called intermediate species, and we use T for the set of these species. We will also call the 
species in <I> = S U *P non-intermediate. Note that T n <!> = but we cannot assume that S n *F is empty 
since modifiers such as enzymes may appear in both. 

In cellular systems, biochemical reactions can happen on very different time scales. There can be 
very frequent reactions (fast reactions) and less frequent reactions (slow reactions). In this case, we can 
apply the Quasi-Steady-State assumption l36l which is a time scale separation approach. We discuss 
other time scale separation/decomposition techniques in Section [8] 

If fast reactions lead to the production of intermediate species, then the instantaneous rates of change 
of the intermediate species in the reaction are approximately equal to zero, with respect to the slow 
reactions, so they can be viewed as being at steady state. The pre-steady-state transient period before 
this happens is much shorter compared to the time taken for slower reactions, and since this period is 
typically of less interest, inaccuracies are less important. 

Specifically, we assume species Xj, i = 1, . . . ,n with T = {Xy, , . . . ,Xj m } the m intermediate species. 
The equation^ dXj k /dt = fj k (X) (where fj k (X) stands for a mathematical expression describing the 



Here the concentration of the species X is represented by the variable X. 
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dynamics of Xj t in terms of all species) is assumed to be approximately equal to zero, and therefore 
fj k (X) ?a 0. From these equations, it may be possible to derive an expression for intermediate species in 
terms of other species. These expressions can replace the variables representing intermediate species in 
the rate equations of other species. The resulting ODE system has fewer equations and terms (reactions). 
Therefore, the effect of the application of QSSA is to simplify the model complexity. This kind of ap- 
proximation and resulting reduction is useful when there are many species and hence many ODEs; when 
systems of ODEs are stiff (they have widely different rates) and hence are difficult to solve numerically 
ll9l ; and when it is difficult experimentally to obtain rate parameters for intermediate species. 

As an example, consider the reactions S + E < i?1 ' i? ~'> SE P + E where 5 is the substrate, P is the 
product, E the enzyme, SE the intermediate substrate-enzyme compound. The double-headed arrow 
represents a reversible reaction, with the forward reaction named R\ and the reverse reaction named 
R i. All reactions are described by mass-action kinetics with rate constants k\, k-\, k 2 . It represents the 
Michaelis-Menten mechanism for enzymatic catalysis |[36ll28l . The corresponding ODE system is 

dS/dt = -hE-S+k-xSE dE/dt = -k x E ■ S + k-xSE + k 2 SE 

dP/dt = +k 2 SE and dSE/dt = +k y E ■ S -k-iSE -k 2 SE 

When the first two reactions are assumed fast and the third slow, the intermediate species SE is considered 
to be at steady-state. We can say that dSE/dt and from this we obtain SE = Ej ■ S / (S + Km), where 
Ej = E + SE is the total enzyme in the system and constant, and Km = (fc-i + k 2 )/k\ (Michaelis-Menten 
constant). Replacing SE with the expression above, we derive the ODE system —dS/dt = dP/dt = 
(k 2 Ej -S)/(S + Km)- We now have a single reaction S + E — > P + E that abstracts the original reac- 
tions. This simplification is called the Michaelis-Menten (MM) approximation and it is valid under some 
assumptions, such So + K m S> Ej where So is the initial quantity of S IT371 . 

This approach provides an analogy for the development of our semantic equivalence which is pre- 
sented in the next section. It leads us to partition reactions into fast and slow, and allows us to abstract 
away from the fast reactions. We now introduce the process algebra to which we will apply these con- 
cepts. 

3 Bio-PEPA with levels 

The syntax of Bio-PEPA with levels [ 14] is given by the grammar below. S defines sequential components 
which describe the behaviour of biochemical species, and P defines model components which combine 
the species components and from which we can understand the interactions between species. 

S::=(a,K)opS\S + S\C op ::=! | f | ® | e | P ::= P MP | S(l) 

In the term (a, fc) op S, a is an action or reaction name from st, K G {1,2,...} is the stoichiometric 
coefficient of the species and the prefix combinator op describes the role of the species in the reaction. 
The symbol 4 is used for a reactant, f a product, an activator, an inhibitor, and for a generic 
modifier. The operator + provides the choice between two sequential components and species constants 

def 

are defined by C = S. The process PWQ denotes the synchronisation between two components P and 
Q and the set L specifies those reactions on which the components must synchronise. We use P t*£] Q to 
denote the case when all actions shared by P and Q are synchronised on. In the model component S(l), 
the parameter / G N represents the level of molecular count or concentration. The set of all Bio-PEPA 
species components is y and the set of all Bio-PEPA model components is ^ . 
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We consider a constrained set of Bio-PEPA models to ensure well-behaved systems. We require that 
a species is a choice between reactions without any repeated actions and that there is only one species 
component for a species at model level, as described by the next definition. 

Definition 3.1. A Bio-PEPA sequential component C is well-defined if it has the form 

C = (cei , K\ ) opj C+ ... + (a n , Kn) op n C written as C = £" =1 (a,-, Jq) op- C where a, ^ ay for i ^ j. 

A model component P is well-defined if it has the form P = C\ (l\ ) XI . . . Xl C p {l p ) where each Ci is a 

well-defined sequential component, the elements of each Lj appear in P and ifi ^ j then Cj ^ Cj. 

Additionally, each model has an associated context collecting together information such as rates, com- 
partments and parameters, as now defined. 

Definition 3.2. A well-defined Bio-PEPA system & is a six-tuple {f \j¥ , ,Comp,P), where Y is 
the set of compartments, .JV is the set of quantities describing each species, is the set of parameters, 
& is the set of functional rates, Comp is the set of well-defined sequential components and P is a well- 
defined model component. "V , .JV , .Jrff , Comp are called the context of P. 

For details of the elements of the context and the definition of well-defined Bio-PEPA system, see lPT4l 
[T3l . In this paper, we only consider single compartment models H. The levels for a species are obtained 
from information contained in JV for that species. More specifically, for each species C, we assume 
a maximu ni molecular count Mq, and a fixed step size H across all species to ensure conservation of 
mass during reactions involving multiple species. The maximum level for a species is determined by 
Nc = \Mc/H] . Thus, C has levels, 0,...,N C , giving N c + 1 levels in total. 

The operational semantics for Bio-PEPA systems with levels is given in Figure Q] where Ns is the 
maximum level for the species S. These operational semantics define two labelled transition systems. The 
enzyme, inhibitor and general modifier prefixes are used in reactions that are not modelled as bimolecular 
reactions with mass action kinetics, hence the semantics for these prefixes reflect the fact that the species 
is not consumed in the reaction and the level remains the same. 

The rules with lowercase letters derive a relation where transitions are labelled with a reaction name 
and a string collecting information about each species involved in the reaction consisting of the species 
name, its role in the reaction, its stoichiometric coefficient for the reaction, and its current level. This 
information is then used in the rule Final which includes the context of the model to determine the rate 
of the reaction and generates the stochastic relation^ from which a CTMC can be obtained. In this paper, 
we focus on the capability relation as we use this in our definition of equivalence. 

Definition 3.3. A capability label is defined as 6 = (tt,w) with a G and the list w defined by the 
grammar w ::= [S : op (I, k) ] \w::w where S € r y, I € N, / > the level, K G N, K > 1 the stoichiometric 
coefficient, and :: is list concatenation. The set of all such capability labels is &. 

Definition 3.4. Given a Bio-PEPA model, the capability relation — > c C x x ^ is the smallest relation 
defined by the first nine rules in Figure\J} An element of the transition system is written P ^ a ' w \ c P'. 

The string w is defined as a list lfl4l but the order of elements is not important so it can be viewed as 
a multiset. For well-defined systems, w is a set lfl9l and we will treat it as such in the sequel. 

2 For a presentation of Bio-PEPA with locations see 1121 . 

3 This is reasonable since cells and other biological compartments have constrained volumes. 

4 For more details on the stochastic relation defined by Final, see 1141 11 31 . Briefly, r a [w,jV,JfT\ = fa[w,<yK,J^]/H 6 
(0,°°) where f a is the functional rate for the reaction a from & and H is the step size. ,yV provides species information and 
,34f provides constants. 
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prefixReac ■ K<l<Ns 

prefixProd < I < Ns — K 

(a,K)tS(l)^m c S(l + K) 

K < I < N s if op = © 

PreflXMOd cm 0<1<N S if op £{©,©} 

(a,K)opS(l) > c S(l) 

choice! ■ choice2 



Final 



(s l +s 2 )(i) ^% c s[(i>) (5i+5 2 )(/) -^ c y 2 (/') 

A^.^ „ P>^V^ 

coopl — a^L coop2 aGL 

Pi P[ ^Pl Pi MP 2 Pi WP^ 

p^.p; P2^n n , rT t t g(Q ( ^ :op(/ ^ c ^(n „ 

coop3 GtGL constant — ; — — C=o 

Pi^^.p;^ c(/) ^P:°p(^)]) ) c5/(//) 



(a,w) 



>,P' 



(r,^,jr,^,Comp,P) (a ' raK/r ' ,jrl V v {V,JY,Jtr,&,Comp,P') 



Figure 1 : Operational semantics of Bio-PEPA 

Definition 3.5. The derivative set ds(P) is the smallest set such that P G ds(P) and if P' G ds{P) and 
P* J&^ c p" then P" G ds{P). P' G ds{P) is a derivative of P. 

This section has defined Bio-PEPA syntax and semantics for Bio-PEPA systems with levels. We 
assume well-defined Bio-PEPA systems with levels for the remainder of the paper. Moreover, we assume 
that in well-defined Bio-PEPA systems all shared actions are synchronised over and hence we use the 
aforementioned notation D£] for cooperation. The next section considers the equivalence we develop. 



4 Fast-slow bisimilarity 

Our basis for developing the equivalence is the QSSA where intermediate species at steady state can be 
approximated. As defined in Section |2j we have a set of intermediate species T and non-intermediate 
species <J> with T n = 0. When comparing models, we need to ensure that we exclude intermediate 
species in the comparison. Therefore we define a function that transforms the set w by removing all 
intermediate species in T, and leaving certain species in A C <1>. Typically, these species will be those 
that appear in transitions for slow reactions in both models with the same role in both reactions. Let 
wa = {C: op (I, k) G w I C G A} and note that (wi"W2)a = (wi)a"(w2)a- 

Since QSSA is based on relative reaction rates, we assume that each reaction can be described as fast 
or slow, leading to a partition of the set srf into the set of slow reactions srf s and the set of fast reactions 
stff. For convenience, we introduce new transitions. 
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Definition 4.1. For well-defined Bio-PEPA models P,P'. 

• IfP -^h c P' and ae4 then P^P'. • IfP -^h c P' and a £ then P -^4- P'. 

• IfP(-»)*P' then P^P'. • IfP(-»)* (-»)*/»' tfien P S» P'. 

These new transitions consider the reaction names, the reaction speeds and certain non-intermediate 
species involved in the reaction, hence the equivalence defined will be semi-quantitative in nature. We 
now define a new bisimulation based on fast and slow actions. 

Definition 4.2. A symmetric relation & over 'tf x ^ is a fast-slow bisimulation for &ff if (P, Q) £ g% 
implies that 

• for all a £ st s whenever P P' there exists Q' with Q Q' and (P', Q') £ ffi, and 

• whenever P -» P' there exists Q' with Q^Q' and (P',Q') £& 

Here -» plays a similar role to in Milner's definition of weak bisimulation |[29l , and hence these 
are similar notions. Some results for weak bisimulation may hold for fast-slow bisimulation but there 
are limits on this, particularly for proofs based on transition derivations. For example, when showing 
congruence where one works with transition derivations, the difference between -» and is apparent 
- this will be discussed in more detail later when congruence with respect to the cooperation is proved. 
We can now define a notion of fast-slow bisimilarity with respect to a given set of fast actions. 

Definition 4.3. P and Q are fast-slow bisimilar for stff (P rm^ Q) if there exists a fast-slow bisimulation 
for £f f , & such that (P, Q) £ 

Then ss^ is the largest fast-slow bisimulation for sz/f. Now that we have a definition, we wish to 
show that it is useful by proving congruence for operators of interest, and by considering an example. 

5 Congruence 

When a semantic equivalence captures the notion of same behaviour we can reason about pairs of systems 
acting the same. However, if we show that a semantic equivalence is a congruence with respect to an 
operator of the process algebra, then we know that we can build new systems with equivalent behaviour 
using that operator. We start by considering the cooperation operator as it would be useful to know that 
we can combine fast-slow bisimilar systems. 

To ensure congruence, it is not possible for fast actions to appear on both sides of the cooperation 
operator. This makes sense since this equivalence abstracts away from the details of the fast reactions, 
and it is not possible to know if the two models have abstracted the same reactions. Moreover, recent 
assessment of Michaelis-Menten approximation has shown that the QSS A does not hold if there are other 
fast reactions (apart from those to which the QSS A is applied) involving the species that are part of the 
Michaelis-Menten reactions [35]]. 

Theorem 5.1. If P\ P 2 then P\ tXI Q P2 D»d Q and Q D»CI Pi Q Of} P2 provided that no action 
in g/f appears in both Q and Pi or in both Q and P2. 

Proof Let & = {(P[ pad Q',^ Dga Q') | P[ ra^ P^}. Consider a transition P[ tX] Q' R which is 

obtained from P[ [>£] Q' \ a ' w ' ) c R since a £ g/ s . There are three cases and we prove the most complex 
here. Assume P[ {a < W[) > c P'{, Q' {a ' W2) > c Q" and w = w\::w 2 then R is Pf pga Q" . Since P[ bs^ P' % , 
Pi -^h Fg and hence P' 2 ■ ■ ■ P^ P» ■ ■ ■ Sli^ c P » for the actions 

ft , . . . , ft, yi, . . . , y m £ sf f . From this, we can derive P' 2 Dp Q' ■ ■ ■ i&*l> c ^ Q> (»^"-^) ) c 
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P» 1^ q" > f . . . ( y """'")> c j*' Dga G", hence we have [x] Q' a ' (wi::W2 > A > p» ^ Q" as required with 
{P'l tx] Q",p% tx] q") e^ 1 . 

Next, consider a transition P{ [xj Q' -» P. Hence there exists f3 £ <e/ f such that P{ [X] g' ^' v) > c /?. 
There are two cases since cooperation over an action in gtff is excluded by the condition. We show the 
case where Q' Q" and R is P[ [x] Q" . Then the transition P 2 IX] Q' -^h c P 2 [x] Q" can be derived 

and hence P 2 IX] Q' => P' 2 D<| Q" with (P( IX] Q" ,P 2 IX] g") G ^ as required. □ 

To see why the sharing of fast actions must be prohibited, consider S\ = (a, 2) f Si + (7, 2) j. Si and 
S 2 = (J3,2) t S 2 + (7,2) 4 S 2 . Clearly Si (0) « {tt ^ } S 2 (0). Considering a third species S = (a, 1) j S, 
it is not the case that Si(0) [x] S(l) ~{ a ,p} S 2 {0) IX] S(l) since these systems have very different be- 
haviours because there are no 7 reactions in the first systems and there are repeating 7 reactions in the 
second. To prevent this difference, S could be modified to require that it perform all fast reactions giving 
S=(a,l)|S+(j8,l)|S but it is not clear how to generalise this beyond species. This counter-example 
for the condition in the theorem demonstrates that the definition of fast-slow bisimulation does differ 
from that of weak bisimulation. Here we abstract away from fast reaction names on a transition, whereas 
with weak bisimulation, transitions with the named action T are treated abstractly. 

5.1 The species extension operator 

Since the focus is well-defined Bio-PEPA models, it is not useful to consider the operators for sequential 
agents individually. However, we do sometimes want to extend species' ability to be involved in reac- 
tions, and we now define an operator that permits this to happen and show that we have congruence for 
this operator under certain conditions. 

When we build the models of biological systems, we may have no knowledge of what other reactions 
the species could be involved in. For example, considering the product of some sequence of reactions, we 
can imagine various scenarios in which we would want the product to be able to react with new species 
added by cooperation. It is difficult to do this at model level but we can consider it at species level by 
defining an appropriate extension operator |[T9l . 

Definition 5.1. Given two well-defined species A and B such that the reaction names of A are disjoint 
from those ofB, define A{B}, the extension of A by B as 

n m n m 

A{B} = £(a,-,K-,)op ( .A{ J B} + £( j 8y,A ; )o P; .A{5} where A = £(o,-, JQ)o Pl -A, B = £ (Pj,Xj)o Pj B. 
i=i 7=1 i=i 7=1 

This permits A to take on additional reaction capabilities, specifically those of B. A{B} is well- 
defined since there are no repeated reaction names and because A and B are well-defined. Note that A{B} 
and B{A} are isomorphic since their transition systems are structurally identical with matching actions. 
The next theorem shows how species can be augmented with ways to participate in new reactions in a 
way which preserves fast-slow bisimulation. 

Theorem 5.2. Let C\ (I) C2{l)for sequential Bio-PEPA components C\ and C 2 for allO <l < Afc, = 
Nc 2 an d let C be a well-defined species with reaction names disjoint from those in C\ and C 2 . Then 
Ci{C}{l) «^C 2 {C}(/) andC{C { }(l) «^C{C 2 }(/). 

Proof Consider a transition Ci{C}(/) --^ Ci{C}(/') for a € sf s then C { {C}{1) -^h c Ci{C}(l'). If 
a appears in C, then C 2 {C}(/) -™> C C 2 {C}(/') and C 2 {C}(1) ====^C 2 {C}(Z'). 

If a appears in Ci and C 2 then Ci (/) -^H Ci (/') and since d (I) C 2 (l), then C 2 (Z) ^» C 2 {1"). 
This then gives C 2 (/) ■ ■ ■ i&M >c C 2 (m) -™> c C 2 {rri) ^-=4 C • • • > c C 2 {1") for the ac- 

tions j3i, ... , j3„,7i, ...,Ym £ &^f- C 2 {C}(1) can perform the same actions hence C 2 {C}(1) =^> C 2 {C}(/"). 
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Next, consider Ci{C}(/) -» d{C}(l'). There exists j8 G gff such that Ci{C}(Z) -i&% c Ci{C}(f)- 
If j8 appears in C then as above, there is a matching transition in C2{C}(/). If /3 appears in C\ since 
Ci (/) Rfcfr C 2 (Z), then C 2 (Z) =► C 2 (Z") which gives C 2 (Z) -^4 C • • • -fai> c C 2 (Z") for ft, . . . , j3„ € 
C2{C}(/) can perform these actions,hence C2{C}(/) C2{C}(Z") as required. □ 

LetCi = (a,l) |Ci andC 2 = (a,l) \C 2 + (J3, 1) 0C 2 with shared maximum level, then Ci (Z) 
C 2 (Z). For any sequential component C with different reaction names to a and j8 and the same maximum 
level, the congruence result applies. 

6 Slow bisimilarity 

Checking for fast-slow bisimilarity requires that all reactions must be considered. We now define an 
equivalence over just the slow reactions. If we can identify conditions under which this equivalence 
implies fast-slow bisimilarity, then whenever we have models that satisfy those conditions, we need only 
check the slow reactions to prove that a relation is a fast-slow bisimulation. This section introduces such 
an equivalence and conditions. In the next section when we consider competitive inhibition, we will 
illustrate how compact our proofs are. First we define the new equivalence. As before, it is assumed that 
£/ s and £/f partition &/. 

Definition 6.1. A symmetric relation St over ^ x'tf is a slow bisimulation for srf s if (P, Q) € M implies 
that for all a € srf s whenever 

• P P' there exists Q' with Q Q' and [P' ,Q') eM 

P and Q are slow bisimilar for srf s if there exists a slow bisimulation for £/ s , Si such that (P, Q) E M. 

We now consider when this can be applied using a technique that allow variables to be classified by 
what reactions affect them. 

6.1 Variable classification 

We present an existing technique that allows the identification of slow variables (those that are only 
affected by slow reactions) and fast variables (those that are affected by fast and slow reactions) ||22| . 
Note that variables are not the same as species since a variable can either be an individual species or a 
linear combinations of species. 

A set of reactions can be expressed as a stoichiometry matrix S which has m columns, one for each 
reaction and n rows, one for each species. S,j describes the stoichiometry of species X,- with respect to 
reaction Rj. 

A stoichiometry matrix can be transformed into a matrix of the same size with the form described 
in Figure [2] [22]. As mentioned above, the variables that are associated with the rows of Q are linear 
combinations of the original species variables and hence when given values for the new variables, it is 
possible to establish values for the species. 

The top row of submatrices in the figure are zeroes since these represent conserved variables and re- 
actions do not affect them. Q„ has size n s x m s and captures the stoichiometry of slow reactions for slow 
variables. The other submatrix in its row is zero since slow variables are not affected by fast reactions. 
The last row of Q consists of annjx m s matrix and annjx m f matrix describing the stoichiometry of 
slow and fast reactions with respect to fast variables. 
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• Columns represent reactions Ri,... R,„ s ,R[... R' mf where the Rj are m s slow reactions, 
the R'j are mj fast reactions, and m s +mf = m 

• Rows represent variables X\ , . . . , X„ c ,X[ , . . . , X{' , . . . X" . where the Xj are conserved 
variables, the X'- are slow variables, the X£ are fast variables and n c +n s +rif = n 



Figure 2: Matrix transformation |[22l . 



For a given ordering of reactions and variables, Q is unique. However, for different orderings, an 
equally valid but different Q may be obtained. If there are no slow variables then this technique cannot 
be used. 

For reasons of space, it is not possible to fully describe the matrix transformation defined in ll22l . The 
idea is based around invariants. These are variables whose values are not changed by the dynamics of the 
model. First, invariants (conserved variables) of the model are identified. Then, by removing the slow 
reactions from the reaction equations, it is possible to find slow variables (invariants when slow reactions 
are removed), if any. Then sufficient fast variables must be identified so that there are n variables in total. 
Each new variable must be linearly independent of the other new variables, and the new variables are 
linear combinations of the original species variables. This process is illustrated in the example section. 

6.2 Application to Bio-PEPA 

Given a Bio-PEPA model, we can construct its stoichiometry matrix from the species component defini- 
tions. Using the technique described above we can identify invariants, slow and fast variable^]. 

Note that a well-defined Bio-PEPA model only differs from its derivatives in terms of the levels of 
each species, hence models can be represented as vectors where each element represents the level of a 
species. See Figure[3]for an example. 

A model's transition system, the capability relation, is then defined over states that are given in 
vector form (y\,...,v p ) for p species. These states can be transformed to (s\,...,s„ s ,fi,..., f„ f ) where 
n s +tif = p—n c , producing a new transition system where the transitions are unchanged and the states are 
defined with respect to the values of the new variables, specifically the slow and fast variables. Conserved 
variable values are not included in the new states since their values are fixed. The new states contain the 
same information as the original states, and they therefore stay unique. It is not possible for two states in 
the original transition system to collapse into one state in the new transition system. We can conclude that 
the transition systems are isomorphic, meaning that there is a bijection between states, and transitions 
are preserved with the same labels. 

We now identify conditions that allow us to show when a slow bisimulation is a fast-slow bisimulation 
using variable classification. We restrict ourselves to the case of equivalence between a model which has 
conserved, slow and fast variables and a model that has conserved variables, slow variables which are the 
same as those in the first model, and no fast variables. We also require that slow variables are individual 
species. Extending the result to more general cases is further work. 

5 These invariants are related to P-invariants in Petri nets 1151 . Invariants can be determined automatically by the Bio-PEPA 
Plug-in 1 16 1. Since the Bio-PEPA Plug-in allows reactions to be removed when inferring invariants, slow variables can also be 
found automatically by removing slow reactions. See also www . biopepa . org 
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Proposition 6.1. Consider two Bio-PEPA models Pi for i= 1,2 where A,- contains exactly the slow species 
of the model, such that Ai amc/ A2 /zave ?/ze same species. Let M he a relation over Bio-PEPA models 
such that for all ((ji, .. . . .. ,/»,), (jj, ■• • , s 'n s )) £ ^> ^ e s i cmds\ are values for all slow variables 

and the f are the values for fast variables If Si = s\for i G {1, . . . n s } and M is a slow bisimulation for 
£/ s , then S? is a fast-slow bisimulation for g/f. 

Proof. Let J 1 be a slow bisimulation with the required condition. Hence we need to consider fast 
actions only. Let ( ..,/„.), (si,...,s ns )) G M and consider the fast transition such that 
(si, . . . ,s„ s ,fi,. . . (si,... ,s Hs ,f v . . . J' nf ). We know that (s u ... ,s Hs ) =^0i,... ,s Hs ) and also that 

((si , . . . ,s„ s ,f[,. . . ,fL),(si , vJ) ^ There are no fast actions from {s\ , . . . ,s„ s ) to consider. □ 

Given two Bio-PEPA models, the general technique can be summarised as follows. 

1 . Classify the variables in each model. Check that one model only has slow variables and that the 
slow variables are species and the same between models. If not, try different variable orderings. 

2. Transform the transition systems of both models as described above. 

3. Define a relation over the Uansformed transition systems of the two models where slow variables 
have the same value in each pair in the relation. 

4. Check that this relation is a slow bisimulation, and use Proposition 16.11 to show that that it is a 
fast-slow bisimulation. 

5. Since the transformation has provided an isomorphic transition system, the original models are 
fast-slow bisimilar. 

7 Competitive inhibition 

We now consider an example where there are significantly different rates and hence a suitable test case 
for fast-slow bisimulation. It is an example of competitive inhibition [ 36 ] where an inhibitor is intro- 
duced, giving the reactions S + EI <— > S + E + 1 <— > SE + 1 — > P + E +1. Here, the first reversible 
reaction describes how the enzyme and inhibitor can bind together to form a compound. The second 
reversible reaction shows how the substrate and enzyme bind together to form a compound from which 
the product can be obtained. The binding of the inhibitor and enzyme competes with the binding of the 
enzyme and substrate since when the enzyme is bound to the inhibitor it is not available for the reaction 
with the substrate and hence reduces the amount of product that can be produced. Then S = {S,E,I}, 
*P = {P,E,I} and T = {EI,SE} since EI and SE are the intermediate species (as defined in Section 0) 
created by these reactions. Because of the explicit representations of the inhibitor and enzyme, and their 
associated intermediates, we choose to model the basic bimolecular reactions with mass actions kinetics. 

These reactions can be expressed in Bio-PEPA as follows, where ai and GC-i are the reactions in- 
volving enzyme and inhibitor, j8i and y3 1 are the reactions involving substrate and enzyme, and / is the 

reaction that produces the product. 

S * (ft, 1)4.5+ (/Li,l)tS P = (YA)tP I = («i,l)t/+(«-i,l)|/ 

EI = (ai,l)|£/+(a_i,l)tE/ SE = y ' (j3i,l)t5£+(j3_i,l)|5£+(y,l)|5£ 

E (a 1 ,l)t£ + (a-i,l)|£ + (j8 1 ,l)|£ + ( j 8_ 1 ,l)t£ + (7,l)t^ 

Sys = S(l s ) Pga E{l E ) tad /(//) Dg3 P(b) Dgd EI(l EI ) D£ SE{l SE ) 
Here, based on biological understanding, we set {oC\, a_i,j3i, j3_i} = stff, namely that these are the fast 
reactions, and that 7 G stf s . 
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ft ft ft 

(5,3,0,0,0.0) < > (4,2.0,0.0,1) < * (3,1,0.0,0,2) < > (2,0.0,0.0,3) (5,3,0,0) 

^ lr ^ |r % l [y ft |r 

(4,3,0,1,0,0) < > (3.2,0.1,0.1) < > (2,1.0,1.0,2) < > (1.0,0.1,0.3) (4,3,0,1) 

l' ft' [r ft' l' ft i' 

(3.3,0.2,0.0) < 1 (2,2.0,2.0,1) < 1 (1.1,0.2,0.2) < > (0,0.0,2.0,3) (3,3,0,2) 

lr ^ 1^ ^ lr lr 

(2,3,0,3,0,0) < » (1,2,0,3,0,1) < > (0,1.0,3.0,2) (2,3,0,3) 

lr ^ lr lr 

(1.3,0.4,0.0) < > (0,2.0,4.0,1) (1,3,0,4) 

^ lr jr 

(0,3,0,5,0,0) (0,3,0,5) 

Figure 3: Transition system for Sys and Sys' for « = 5 and m = 3 with no inhibitor present (only reaction 
names appear on transitions). 



We wish to show that this is fast-slow bisimilar to the simpler Bio-PEPA model defined as follows. 
In this model, only a single reaction is modelled and this reaction has a rate which takes into account the 
amount of enzyme and inhibitor present. This reaction is considered to be a slow reaction. Since this 
reaction is not based on mass actions kinetics, the inhibitor and enzyme prefix operators are used. The 
reaction is named y as it produces P, as in the previous model. The use of primes on species and model 
names is a syntactic convenience to distinguish different species and model components. However, later 
on we will view P and P' as the same when we consider w\. 

s' = ; (r,i)is' e> * (r,i)e£' i' =' (7,1)0/' p' =' (y,mp' 

Sys' = S' (Zy ) Eg E'(l E ,) Dga /'(//') Dsa P 1 (b> ) 

From the equations, it is clear that for a starting level of the substrate 5 (or 5') given by 1$ = n it is 
not possible to reach more than n levels of P (alternatively P') if the starting level of P is set to lp = 0. 
This agrees with the biological understanding that these reactions represent a transformation of S to P 
through a number of steps. 

We assume neither of the intermediates nor the product are present at the start in the more complex 
model as is standard 11371 . Hence, for a starting level of m of the enzyme, it is not possible to have more 
than m levels of the substrate-enzyme compound, and for a starting level of m of the enzyme and p of the 
inhibitor, it is not possible to have more than min{m,p} levels of the enzyme-inhibitor compound. 

As mentioned above, a well-defined Bio-PEPA model only differs from its derivatives in the levels 
of the species and models and derivatives can be expressed in numeric vector form. For example, for Sys 
the vector (2,0,3, 1,0,4) describes the model with 2 levels of S, none of E, 3 of /, 1 of the product P, 
none of the compound EI and 4 of the compound SE. 

7.1 Constructing the bisimulation 

Under the initial species levels described above, there are four cases of interest: only substrate present, 
substrate and enzyme present, substrate and inhibitor present, and substrate, enzyme and inhibitor present. 
These can be considered in one relation over the two models with starting vectors (n,m,p, 0,0,0) (us- 
ing the ordering (IsJeJiJpJeiJse)) and (n,m,p,0) (using the ordering (Is'Je'Ji'Jp 1 )) with n > and 
m,p>0. Figure[3]illustrates the case when n = 5,m = 3 and p = 0. This case with no inhibitor represents 
an instance of the standard Michaelis-Menten mechanism [ 36] as discussed in Section [2] 
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X St = S + SE + P = S = 

X Et = E + EI + SE = E Q = 

X h = EI + I = I = 

X P = P 



Sys: new variables 



n conserved 

m conserved 

p conserved 

k slow 



Sys': 

% = S' + P' 




= n conserved 

= m conserved 

= p conserved 

= k slow 



Xe' — E 
X F = I' Q 
X P , = P' 



Xei = EI = 
Xse = SE = 
new state: (P,EI,SE) 



1 fast 
j fast 



new state: (P) 



Figure 4: Identification of conserved, fast and slow variables 



To define fast-slow bisimulation, we must determine which non-intermediate species are in the set A. 
The label on the transition of a 7 reaction in Sys is (7, w) where w = {P: f ( 1 , h ) , SE : 4. ( 1 , h ) , E : t ( 1 , i$ ) }. 
For a 7m Sys', the set is {P' :^ (\ , ji) ,S' : l(\ , j 2 ) ,E' : (B (I Ji) ,1' : Q (\ , j 4 )}. We only want to compare 
species that appear in both sets and that have the same role, hence we let A = {P} = {P'}. We will show 
below that the product is also the slow variable of both systems, illustrating another way to determine A. 

Next define the relation M as 
{((n—(k+j),m—(j+l),p—l,k,l,j),(n—k,m,p,k)) \ 0<k<n,0<j<mm{m,n-k},0<l<p,j+l<m}. 

This captures the idea suggested by Figure [3] that states with the same level of product are those that 
should be paired in ffl. We now show that S& is a fast-slow bisimulation for s/f using the approach 
given in the previous section. Figure [4] provides the new variables for each model. Here, variables 
with subscript indicate initial values for those species. First three invariants are identified, then we 
consider just the fast transitions and this allows us to determine which species are not affected by the 
fast transitions. P is not affected and neither is S + SE. Since these are not linearly independent (due to 
the first invariant), we need to choose one of them, and we choose P since it is a single species. There 
are no other linearly independent slow variables so we need to find two fast variables that are linearly 
independent from each other and the four defined variables. El and SE are suitable candidates. The 
technique can also be applied to the variables in Sys' where there are no fast variables since the only 
reaction 7 is slow. 

Hence the states of the transition systems can be transformed without changing the labels on the 
transitions. The new transition systems have the same form as the original transition systems, but the 
new states are vectors with the first three elements of the original vector removed. A new relation 
can be defined over these new transition systems that preserves the relationship between states. Let 
St' = [((k,l,j),(k)) I 0<k<n,0<j<rmn{m,n-k},0<l<p,j+l<m}. 

Since has the form required for the application of Proposition 16. II and the two models have the 
same slow variables, if M' is a slow bisimulation for s$ s , then it is a fast-slow bisimulation for sif. The 
new transition system is isomorphic to the original transition system and the relationship between states 
is preserved by ffl', hence M is also a fast-slow bisimulation for stff over the original transition system. 

We now proceed with the proof that & is a slow bisimulation for srf s . For notational convenience, 
we let (P)i = {P: t (1,/)}, and consider in turn the different cases for which there are 7 transitions. 

• Consider ((&,/, j), (k)) E ffl' for < k < n, < / < p, < j < minim, «— k} which represent 
states where some substrate-enzyme compound available. Then (k) > is matched by 
(k,l,j) Y '^ k > (k+l,l,j—l) and vice versa. 

• Consider ((&,/, 0), (k)) € & for < k < n, < / < p when no substrate-enzyme compound is 
present. There are three cases depending on the relationship of m and p. 
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• If m > p, consider < / < p. Since m is greater than p, whatever / is, there will be additional 
enzyme to form SE and (jfc) (k+1) is matched by (k,l,0) -» (k,l,l) J ^ J + (k+1, 1,0). 

• If m < p and < / < m— 1, then enzyme is available and the previous case applies. 

• If m < p and I = m, then all enzyme is bound in EI. Then (k) Y ^ p ' )k ) (k+1) is matched by 

(k,0,m) -» (k,0,m-l) -» (k,l,m-l) ^+ (k+l,0,m-l). 

An example of the first subcase is illustrated in Figure [3] in the unmodified transition system. Con- 
sider the pair of states ((2,3,0,3,0,0), (2,3,0,3)) G M. The /-transition from (2,3,0,3) to (1,3,0,4) is 
matched by a fast transition from (2,3,0,3,0,0) to (1,2,0,3,0,1) and a /-transition from the latter to 
(1,3,0,4,0,0) and ((1,3,0,4,0,0), (1,3,0,4)) G ffl. 

To conclude, we have shown for {ai,a_i,j8i,j8_i} C g/f, y g g/ s for the models Sys and Sys' that 
(n,m,p,0, 0, 0) Wjgt (ft, wj, /?, 0) for all positive n, m and which covers all major cases of interest. Hence, 
we can conclude that the simpler model demonstrates the same behaviour (at a semi-quantitative level) 
as the more complex model when we abstract from fast reactions. We can apply the congruence result: 
if P is a Bio-PEPA model with no fast reactions in {oci, a_i,j3i,j3_i}, then since Sys Sys', we know 
that P 1X1 Sys P IX] Sys' . This allows us to build new systems, and also to replace the larger state 
space of Sys with the smaller one of Sys' . 

8 Related and further work 

Various approaches to modelling biological systems using process algebra have been proposed including 
K-calculus (HI, ^-calculus 031221151, Beta-binders ED, Bio-Ambients 021 sCCP M and the con- 
tinuous 7r-calculus [26]. Most of these approaches use stochastic simulation as their analysis tool, and 
very few approaches have considered the use of semantic equivalences. Both weak bisimulation and 
context bisimulation are shown to be congruences for the bio-fc-calculus. Context bisimulation allows 
for the modelling of cell interaction |[27l . Observational equivalence has been used to show that CCS 
specifications of elements of lactose operon regulation have the same behaviour as more detailed models 
QUI . In an example of biological modelling using hybrid systems, bisimulation is used to quotient the 
state space with respect to a subset of variables as a technique for state space reduction [lj. Bisimulation 
has also been used in the comparison of ambient-style models and membrane-style models iTTTTl and the 
comparison of a term-rewriting calculus and a simple brane calculus 01- Other equivalences have been 
defined for Bio-PEPA. Compression bisimilarity is based on the idea that different discretisations of a 
system should have the same behaviour assuming sufficient levels [20]. Strong and weak bisimulation 
parameterised by functions have also been defined lfl9l and their use demonstrated on a model with 
alternative pathways. Further work is to determine whether fast-slow bisimilarity can be expressed as 
g-bisimilarity. 

Although fast-slow bisimilarity is defined in the context of Bio-PEPA, it is applicable to any formal- 
ism with the same style of stratification of molecular counts or discretisation of concentrations, such as 
the Petri net-based modelling framework of Heiner et al lf23l . 

QSSA has also been applied to stochastic simulation |[2ll either to obtain approximate rates O or in 
the case of slow-scale stochastic simulation (9l [TOj to identity slow and fast species which then leads to 
the introduction of a virtual fast process representing the fast species where slow reactions are removed. 

As mentioned earlier, QSSA is a time scale separation technique. There are other variants such as 
tQSSA which consider the total substrate (both free and bound) in deriving reduced equations and is 
applicable when So +K m S> Et does not hold Q. QSSA approaches have been formalised by single 
perturbation theory 0711381 . 
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Another form of time scale decomposition/separation considers CTMCs and is based on a decompo- 
sition/ aggregation technique for solving for steady state. In a nearly completely decomposable CTMC, 
the values in the diagonal blocks are much larger (at least one order of magnitude) than those in the 
off-diagonal blocks ifTTIl . Hence there are blocks of states where transitions between states in an block is 
much more frequent than uansitions between states in different blocks. The technique involves solving 
for steady state for each block (ignoring transitions to other blocks). Each block is then considered as 
a single state, and transition rates between these states are computing, and the aggregate CTMC con- 
structed is solved. Finally, the solutions for each block and the aggregate CTMC are combined to obtain 
an approximate solution for the original CTMC. 

This technique has been applied to both stochastic Petri nets [4] and stochastic process algebra ll25ll . 
For Petri nets, a function is defined over markings to determine which markings are similar and must 
take into account relative rates. In the case of stochastic process algebra, an analysis of processes and 
the rates of the actions they enable is the starting point for identifying subsets of states. Sequential 
components are categorised as fast, slow and hybrid, and states are grouped when they have the same 
slow subcomponents. The passive rate can be used to split hybrid processes into two sequential processes 
with the same behaviour. A time scale decomposition technique for transient analysis [6] is also relevant 
because our model considers transient behaviour as well as steady state behaviour. Since we are not 
working fully quantitatively here, these approaches are issues for further research. Specifically, we wish 
to compare the application of the technique for nearly completely decomposable CTMCs with a QSSA- 
based quantitative equivalence, as well as considering transient behaviour. 

Quantitative equivalences have been defined for CTMCs based on Kripke structures, hence with 
unlabelled transitions and labelled states [2]. Both weak bisimulation and weak simulation are defined. 
Further research involves applying these equivalences, after suitable modification to CTMCs obtained 
from labelled transition systems and seeing their relationship with the QSSA. 

Most previous CTMC research assumes fixed rates; however, with Bio-PEPA rates are state-dependent 
which introduces additional complexity. 

9 Conclusion 

We have developed fast-slow bisimilarity, a semi-quantitative semantic equivalence motivated by the 
Quasi-Steady-State Assumption. We show that for two operators of interest, fast-slow bisimilarity is 
a congruence. For the cooperation operator, a reasonable condition is required to ensure congruence. 
The second operator is an extension operator which allows a species to be extended with new reactive 
capabilities. Although the definition of fast-slow bisimilarity is similar to that of weak bisimilarity, 
the condition for congruence for cooperation illustrates how they differ. For certains types of reduced 
models, it is possible to work with slow bisimilarity which only considers slow reactions. The use of 
fast-slow bisimilarity is illustrated with an example of competitive inhibition, where one system includes 
the intermediate compounds and the other does not. 

This equivalence can be used to show that a reduced system has the same behaviour as the full system. 
This means it is possible to work only with the reduced system, thereby reducing the number of param- 
eters that need to be fitted. Fast-slow bisimilarity can be applied in any context where concentrations are 
discretised or molecule counts are grouped. 

Further work includes a fully quantitative equivalence, automation of the bisimulation technique 
including variable reduction and investigation of dynamically changing the sets of fast and slow reactions. 
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